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Abstract The dynamic capacity theorem characterizes the reliable communication rates of a quantum 
channel when combined with the noiseless resources of classical communication, quantum communication, 
and entanglement. In prior work, we proved the converse part of this theorem by making contact with 
many previous results in the quantum Shannon theory literature. In this work, we prove the theorem with 
an "ab initio" approach, using only the most basic tools in the quantum information theorist's toolkit: 
the Alicki-Fannes' inequality, the chain rule for quantum mutual information, elementary properties of 
quantum entropy, and the quantum data processing inequality. The result is a simplified proof of the 
theorem that should be more accessible to those unfamiliar with the quantum Shannon theory literature. 
We also demonstrate that the "quantum dynamic capacity formula" characterizes the Pareto optimal 
trade-off surface for the full dynamic capacity region. Additivity of this formula reduces the computation 
of the trade-off surface to a tractable, textbook problem in Pareto trade-off analysis, and we prove 
that its additivity holds for the quantum Hadamard channels and the quantum erasure channel. We then 
determine exact expressions for and plot the dynamic capacity region of the quantum dephasing channel, 
an example from the Hadamard class, and the quantum erasure channel. 

PACS 03.67.Hk • 03.67.Pp 



1 Introduction 

Quantum Shannon theory is the study of the transmission capabilities of a noisy resource when a large 
number of independent and identically distributed (IID) copies of the resource are availablej^ An im- 
portant task in this area of study is to determine how noiseless resources interact with a noisy quantum 
channel. That is, we would like to know the reliable communication rates if a sender can use noiseless 
resources in addition to a quantum channel to generate other noiseless resources. In prior work, we 
have studied one such setting, where a sender and receiver generate or consume classical communication, 
quantum communication, and entanglement along with the consumption of a noisy quantum channel [16[ 
[TSIITT]. The result of these efforts was a characterization of the "dynamic capacity region" of a noisy 
quantum channel 

One of the shortcomings of the characterization of the dynamic capacity region in Refs. [T6 l [T5 | [T7] is 
that its computation for a general quantum channel is intractable. Though, later, Bradler et al. demon- 
strated that the quantum Hadamard channels [18] are a natural class of channels for which the compu- 
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tation of one octant of the region is tractable [5] because the structure of these channels appears to be 
"just right" for this tractability to hold. The proof considers special quadrants of the dynamic capac- 
ity region and employs several reductio ad absurdum arguments to characterize one octant of the full 
region. The work of Bradler et al. showed that we can claim a complete understanding of the abilities 
of an entanglement-assisted quantum Hadamard channel for the transmission of classical and quantum 
information. 

The aim of the present work is two-fold: 1) to simplify the proof of the converse part of the dynamic 
capacity theorem in Ref. |17j and 2) to show that there is one important formula to consider for any 
task involving noiseless classical communication, noiseless quantum communication, noiseless entangle- 
ment, and many uses of a noisy quantum channel. Our previous proof of the converse in Ref. [17j relies 
extensively on prior literature in quantum Shannon theory, perhaps making our ideas inaccessible to an 
audience unfamiliar with this increasingly "tangled web." Here, we apply an "ab initio" approach to the 
proof, using only four tools from quantum information theory: the Alicki-Fannes' inequality [T], the chain 
rule for quantum mutual information, elementary properties of quantum entropy, and the quantum data 
processing inequality |23| . As such, the proof here should be more accessible to a broader audience and 
more streamlined because it is "disentangled" from the complex web that the quantum Shannon theory 
literature has become. 

We also propose a new formula that characterizes any task involving classical communication, quan- 
tum communication, and entanglement in dynamic quantum Shannon theory]^ For this reason, we call 
this formula the "quantum dynamic capacity formula." In particular, additivity of this formula implies a 
complete understanding of any task in dynamic quantum Shannon theory involving the three fundamen- 
tal noiseless resources. We find a simplified, direct proof that additivity holds for the Hadamard class 
of channels and for a quantum erasure channel — thus their full dynamic capacity regions are tractable. 
The additivity proof for the quantum erasure channel is different from that of the Hadamard channel — it 
exploits the particular structure of the quantum erasure channel. 

We structure this work as follows. The next section reviews the minimal tools from quantum informa- 
tion theory necessary to understand the rest of the paper. Section [3] outlines the information processing 
task considered in this paper, defines what it means for a rate triple to be achievable, and provides a 
definition of the dynamic capacity region of a quantum channel. Section |4] states the dynamic capacity 
theorem, and Section [5] contains a brief review of the proof of the achievability part of the theorem. The 
main protocol for proving this part is the "classically-enhanced father protocol," whose detailed proof 
we gave in Ref. |16| . Section |6] contains the converse proof, where we proceed with the minimal tools 
stated above. We then show in Section [7] how the quantum dynamic capacity formula characterizes the 
optimization task for computing the Pareto optimal trade-off surface for the dynamic capacity region. 
Section [H] proves that the quantum dynamic capacity formula is additive for the Hadamard class of chan- 
nels, and in Section [9] we directly compute and plot the region for a qubit dephasing channel, which is a 
channel that falls within the Hadamard class. Section [TO] then shows that the quantum dynamic capacity 
is additive for the quantum erasure channel, and we compute and plot the region for this channel also. 
Finally, we conclude with a brief discussion. 



2 Definitions and notation 

We first establish some definitions and notation that we employ throughout the paper and review a few 
important properties of quantum entropy. Let (p^^ denote the maximally entangled state shared between 
two parties: 

An ebit corresponds to the special case where = 2. Let (p^'-^'^'^^ denote the maximally correlated state 
shared between two parties: 

-I D 

$ =—^\l){l\ . 

i=l 

"Dynamic quantum Shannon theory" refers to the setting in which a sender and a receiver have access to many uses 
of a quantum channel connecting them. 
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A common randomness bit corresponds to the special case where D — 2. 

A completely-positive trace-preserving (CPTP) map N'^ is the most general map we consider 
that maps from a quantum system A' to another quantum system B [22j . It acts as follows on any density 
operator p: 

k 

where the operators Ak satisfy the condition ^^^4^ = /. A quantum channel admits an isometric 
extension U^^^^, which is a unitary embedding into a larger Hilbert space. One recovers the original 
channel by taking a partial trace over the "environment" system E. 

We consider a three-dimensional capacity region throughout this work (as in Ref. 17 ), whose points 
(C, Q, E) correspond to rates of classical communication, quantum communication, and entanglement 
generation/consumption, respectively. For example, the teleportation protocol consumes two classical 
bits and an ebit to generate a noiseless qubit ^3 . Thus, we write it as the following rate triple: 

(-2,1,-1), 

where we indicate consumption of a resource with a negative sign and generation of a resource with a 
positive sign. Also, the super-dense coding protocol consumes a noiseless qubit channel and an ebit to 
generate two classical bits |6j. It corresponds to the rate triple: 

(2,-1,-1). 

Another protocol that we exploit is entanglement distribution. It uses a noiseless qubit channel to estab- 
lish a noiseless ebit and corresponds to 

(0,-1,1). 

The entropy H {A) ^ of a density operator p"* on some quantum system A is as follows |22j : 

H(A)^^-Tr{p^logp^}, 

where the logarithm is base two. The entropy can never exceed the logarithm of the dimension of A. The 
quantum mutual information / {A; B)^ of a bipartite state is as follows: 

/ {A- B)^ EE H (A), + H (B)^ - H {AB)^ . 

Observe that the quantum mutual information I [Ma] Mb) of the state t^*^-^*^^ jg equal to logD bits, 
and the quantum mutual information / (A; B) of the state <1>^^ is equal to 2 log D qubits. If one system is 
classical, then the quantum mutual information can never be greater than the logarithm of the dimension 
of the classical system. If both systems are quantum, then the quantum mutual information can never 
be greater than twice the minimum of the logarithms of the dimensions of the two quantum systems. 
The quantum mutual information vanishes if the bipartite state is a product state. The conditional 
quantum mutual information for three quantum systems A, B, and C is as follows: 

/ {A- B\C) EE H {AC) + H [BC) - H [C) - H {ABC) , 

and is always non-negative due to strong subadditivity [20]. The coherent information I{A)B)^ of a 
state is as follows [25] : 

I{A)B)^^H{B)^-H{AB)^. 

Observe that the coherent information / {A)B) of the state ^"^^ is equal to \ogD qubits. The chain rule 
for quantum mutual information gives the following relation for any three quantum systems A, B, and 
C: 

I{AB;C) = I{B;C)+I{A-C\B) (1) 
= I{A-C) + I{B-C\A). 

A classical-quantum state u^^-^^ of the following form plays an important role throughout this 
paper: 

^XABE ^ J2px {x) |x) {xf f/^'-^^(0r'), 

X 
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where the states (j)-^"^ are pure bipartite states and -^be ^y^q isometric extension of some noisy 
channel Applying the above chain rule gives the following relation: 

I{AX-B)^ = I(X-B)^ + I{A-B\X)^. (2) 

Additionally, one can readily check that the following relation holds for a state of the above form: 

/ {A)BX)^ = \l {A- B\X)^ - \l {A- E\X)^ . (3) 

The Alicki-Fannes' inequality is a statement of the continuity of coherent information [1], and a simple 
variant of it gives continuity of quantum mutual information. First, suppose that two bipartite states 
p"^^ and are e-close in trace norm: 

Then the Alicki-Fannes' inequality states that their respective coherent informations are close: 

\I{A)B)p - I[A)B)„\ < 4elog |^| + 2i72(e), 

where |^| is the dimension of the system A and H2 is the binary entropy function. A simple "tweak" of 
the above inequality shows that the quantum mutual informations are close |16] : 

\IiA; B)p - I{A; B)„\ < 5elog \A\ + SH^ie). 

The quantum data processing inequality states that quantum correlations can never increase under 
the application of a noisy map [231122) . There are two important manifestations of it. Suppose that Alice 
possesses two quantum systems A and A' in her lab. She sends A' through a noisy quantum channel 
to Bob and he receives it in some system B. Then the following inequality applies to the coherent 
information: 

IiA)A')>IiA)B), 

demonstrating that quantum correlations, as measured by the coherent information, can only decrease 
under noisy processing. The following inequality also applies to the quantum mutual information: 

I{A;A')>IiA-B), 

demonstrating a similar notion for the quantum mutual information. Now suppose that Alice sends A to 
Christabelle and she receives it in some system C. Then the following inequality applies as well: 

IiA-A')>IiC;A')>IiC;B), 

but a similar inequality does not hold for the coherent information, due to its asymmetry. We make 
extensive use of quantum data processing in our proofs. 



3 The information processing task 

We are interested in the most general protocol that generates classical communication, quantum commu- 
nication, and entanglement by consuming many uses of a noisy quantum channel Af"^ and the same 
respective resources (see Figure [l]) . We say that such a protocol is "catalytic" because we are allowing 
it to consume the same resources that it generates, though we are keeping track of the net rates of 
consumption or generation. 

The protocol begins with Alice possessing two classical registers (each labeled by M and of dimension 
2"*-^), a quantum register Ai of dimension 2"'^ entangled with a reference system R, and another quantum 
register Ta of dimension 2"-^ that contains her half of the shared entanglement with Bob: 

^MMBA^TaTb ^ ^ ^BA, ^ ^TaTb 

She passes one of the classical registers and the registers Ai and Ta into a CPTP encoding map 
gMAiTA-^A SaLA2 outputs a quantum register Sa of dimension 2"^ and a quantum register A2 of 
dimension 2""^, a classical register L of dimension 2"^ , and many quantum systems A'" for input to the 
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Reference . . 




Fig. 1 (Color online) The most general protocol for generating classical communication, quantum communication, and 
entanglement with the help of the same respective resources and many uses of a noisy quantum channel. Alice begins with 
her classical register M, her quantum register Ai, and her half of the shared entanglement in register T4. She encodes 
according to some CPTP map £ that outputs a quantum register Sa, many registers A'", a quantum register A2, and 
a classical register L. She inputs A'" to many uses of the noisy channel Af and transmits A2 over a noiseless quantum 
channel and L over a noiseless classical channel. Bob receives the channel outputs B", the quantum register A2, and the 
classical register L and performs a decoding V that recovers the quantum information and classical message. The decoding 
also generates entanglement with system Sa- Many protocols are a special case of the above one. For example, the protocol 
is entanglement-assisted communication of classical and quantum information 116' if the registers L, Sa, Sb, and A2 are 
null. 



channel. The register Sa is for creating entanglement with Bob. The state after the encoding map £ is 
as follows: 

^MA'"-SaLA2RTb ^ ^MAiTa~*A'"^SaLA2(^^MMRAiTaTb'^ 

She sends the systems A'" through many uses of the noisy channel transmits L over 

a noiseless classical channel, and transmits A2 over a noiseless quantum channel, producing the following 
state: 

^MB"SaLA2RTb ^ j^A'"^B" ^^MA'"SaLA2RTb'^ 



The above state is a state of the form in ( 15 1 with A = RTbA2Sa and X = ML. Bob then applies a map 



'£)B"A2TbL^BiSbM ^Yig^^ outputs a quantum system Bi, a quantum system Sb, and a classical register 
M. Let oj' denote the final state. The following condition holds for a good protocol: 



MM 



RBi 



..MBiSbMSaR 



(4) 



implying that Alice and Bob establish maximal classical correlations in M and M and maximal entangle- 
ment between Sa and Sb - The above condition also implies that the coding scheme preserves the entangle- 
ment with the reference system R. The net rate triple for the protocol is as follows: 
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The protocol generates a resource if its corresponding rate is positive, and it consumes a resource if its 
corresponding rate is negative. Such a protocol defines an {n, C, Q, E, e) code with 

(5) 
(6) 

(7) 

Definition 1 (Achievability) A rate triple (C, Q, E) is achievable if there exists an (n, C, Q, E, e) code 
with error (as defined in Q) smaller than e for all e > and sufficiently large n. 



c 




c, 


Q 


= Q- 




E 


= E- 


E. 



Definition 2 (Dynamic Capacity Region) The dynamic capacity region Ccqe (A/") of a noisy quan- 
tum channel Af^ is a three-dimensional region in the (C, Q, E) space defined by the closure of the 
set of all achievable rate triples (C, Q,E). 



4 The dynamic capacity theorem 

The dynamic capacity theorem gives bounds on the reliable communication rates of a noisy quantum 
channel when combined with the noiseless resources of classical communication, quantum communica- 
tion, and shared entanglement [T7] . The theorem applies regardless of whether a protocol consumes the 
noiseless resources or generates them. 

Theorem 1 (Dynamic Capacity) The dynamic capacity region Ccqe (A/") of a quantum channel Af is 
equal to the following expression: 



Ccqe(AA) = U ^4qe(A/'^'=)> (8) 

k=l 

where the overbar indicates the closure of a set. The "one-shot" region Cqq^IM) is the union of the 
"one-shot, one-state " regions C^gg ^ (Af) : 

Ce(-^)^UCe,.(-^)- 

The "one-shot, one-state" region Cqq^ ai-^) °f '^^^ rates C, Q, and E, such that 

C + 2Q <I{AX-B),, (9) 
Q + E <I{A)BX),, (10) 
C + Q + E<I{X-B)„+I{A)BX)^. (11) 

The above entropic quantities are with respect to a classical- quantum state a^^^ where 

a^^^ ^ Y^Pix) \x) ® AA-4'-^(</.r'), (12) 

X 

and the states (f>^^ are pure. It is implicit that one should consider states on A'^ instead of A' when 
taking the regularization in 

The above theorem is a "multi- letter" capacity theorem because of the regularization in (|8| . Though, 
we show in Sections [8] and [lO] that the regularization is not necessary for the Hadamard class of channels 
or the quantum erasure channels, respectively. We prove the above theorem in two parts: 

1. The direct coding theorem below shows that combining the "classically-enhanced father protocol" 
with teleportation, super-dense coding, and entanglement distribution achieves the above region. 

2. The converse theorem demonstrates that any coding scheme cannot do better than the regulariza- 
tion in ([s]), in the sense that a scheme with vanishing error should have its rates below the above 
amounts. We prove the converse theorem directly in "one fell swoop," by employing a catalytic, 
information-theoretic approach. The converse proof is different from our earlier one |17| because we 
employ straightforward information-theoretic arguments instead of making contact with prior quan- 
tum Shannon theoretic literature. 
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5 Dynamic achievable rate region 



The unit resource achievable region is what Ahce and Bob can achieve with the protocols entanglement 
distribution, teleportation, and super-dense coding [T7]. It is the cone of the rate triples corresponding 
to these protocols: 

{a (0,-1, 1) + /? (2,-1, -1) + 7 (-2, 1,-1) :«,/?, 7 > 0} . 
We can also write any rate triple (C, Q,E) in the unit resource capacity region with a matrix equation: 



'c 




" 2-2" 




a 


Q 




-1 -1 1 






E 




1 -1 -1 




J. 



(13) 



The inverse of the above matrix is as follows: 



-1 

1 _i 

! f 



and gives the following set of inequalities for the unit resource achievable region: 

C + 2Q < 0, 
Q + E<0, 
C + Q + E<0, 



by inverting the matrix equation in (13) and applying the constraints a,l3,^> 0. 

Now, let us include the classically-enhanced father protocol . Ref . [TB] proved that we can achieve 
the following rate triple by channel coding over a noisy quantum channel J\f^ 



I {X- B)^ , \l {A- B\X)^ {A- E\X)^ 



for any state a^^^^ of the form: 



XABE 



^px {x) \x) {x\ 



X 



>BEfiAA' 



(14) 



where U^^^^ is an isometric extension of the quantum channel J\f"^ . Specifically, we showed in 
Ref. [TB] that one can achieve the above rates with vanishing error in the limit of large blocklength. Thus 
the achievable rate region is the following translation of the unit resource achievable region in (13): 



'c 




" 2-2" 




a 




Q 




-1 -1 1 




P 


+ 


E 




1 -1 -1 




J. 





^I{A;B\X)^ 
-ll{A;E\X)^ 



We can now determine bounds on an achievable rate region that employs the above coding strategy. We 
apply the inverse of the matrix in (13) to the LHS and RHS. Then using ([2]), ([S]), and the constraints 
Q!,/3,7 > 0, we obtain the inequalities in (9pT), corresponding exactly to the one-shot, one-state region 
in Theorem [l] Taking the union over all possible states cr in (14) and taking the regularization gives the 
full dynamic achievable rate region. 
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6 Catalytic and information theoretic converse proof 



This section begins one of the main contributions of this work. We provide a catalytic, information theo- 
retic converse proof of the dynamic capacity region, showing that ([s]) gives a multi-letter characterization 
of it. The catalytic approach means that we are considering the most general protocol that consumes 
and generates classical communication, quantum communication, and entanglement in addition to the 
uses of the noisy quantum channel. This approach has the advantage that we can prove the converse 
theorem in "one fell swoop" rather than considering one octant of the (C, Q, E) space at a time as we did 
in Ref. jl7j . Additionally, we do not need to make contact with prior work in quantum Shannon theory. 
We employ the Alicki-Fannes' inequality, the chain rule for quantum mutual information, elementary 
properties of quantum entropy, and the quantum data processing inequality to prove the converse. 

There are some subtleties in our proof for the converse theorem. We prove that the bounds in (|9|- 



11 1 hold for common randomness generation instead of classical communication because a capacity for 
generating common randomness can only be better than that for generating classical communication 
(classical communication can generate common randomness). We also consider a protocol that pre- 
serves entanglement with a reference system instead of one that generates quantum communication. 
Barnum et al. showed that this task is equivalent to the transmission of quantum information [2] . 
We prove that the converse theorem holds for a state of the following form 

a^^^ ^ Y,p{x) \x) (rrr ® AA-4'-^(p^^'), (15) 



where the states p'^^ are mixed, rather than proving it for a state of the form in (14). Then we show in 
Section [6.1| that it is not necessary to consider an ensemble of mixed states — i.e., we can do just as well 
with an ensemble of pure states, giving the statement of Theorem [T] 

We begin by proving the first bound in (|9]). All system labels are as given in Sectionjs] and we consider 
the most general protocol as outlined in that section. Consider the following chain of inequalities: 

n{C + 2Q) = I{M- M)^ + I {R; B,)^ 

<I{M;M)^,+I{R-B^)^,+nS' 

< I (M; B''A2LTb)^ + I {R; B^A2LTb)^ 

< I (M; B''A2LTb)^ + I {R; B^A2LTbM)^ 

= / (M; B''A2LTb)^ + / (i?; B''A2LTb\M)^ + / (i?; M)^ . 

— MM 

The first equality holds by evaluating the quantum mutual informations on the respective states ^ 
and (^^^1 . The first inequality follows from the condition in (jZj) and an application of the Alicki-Fannes' 
inequality where 5' vanishes as e — > 0. We suppress this term in the rest of the inequalities for conve- 
nience. The second inequality follows from quantum data processing, and the third follows from another 
application of quantum data processing. The second equality follows by applying the mutual information 
chain rule in ([T]). We continue below: 

= / (M; B^A2LTb)^ + / {R; B''A2LTb\M)^ 

= I (M; B"A2LTb)„ + / {RA2LTB; S"|M)^ + / {R; A2LTb\M)^ - I (S"; A2LTb\M)^ 

= I (M; B")^ + / {M; A2LTb\B'')^ + I {RA2LTB; B''\M)^ + I {R; A2LTb\M)^ - I (B"; A2LTb\M)^ 

= I{RA2LMTb-,B^)^+I{M-A2LTb\B''')^+I{R-A2LTb\M)^^I{B''-A2LTb\M)^ 

< I {RA2TbSaLM- B")^ + / [M- A2LTb\B'')^ + I {R; A2LTb\M)^ - / (B"; A2LTb\M)^ 

= I {AX; i?")„ -I- / (Af ; AaiTslB"), + / {R; A2LTb\M)^ ~ I {B^^; A2LTb\M)^ . 

The first equality follows because / (R; M)^ = for this protocol. The second equality follows from 
applying the chain rule for quantum mutual information to the term / (i?; B"" A2LTb\M) and the third 
is another application of the chain rule to the term I {M; B"A2LTb)^. The fourth equality follows by 
combining I{M;B"-)^ and I {RA2LTB; B"\M)^ with the chain rule. The inequality follows from an 
application of quantum data processing. The final equality follows from the definitions A = RTbA2Sa 
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and X = ML. We now focus on the term I (M; AsLTijIB")^ + / {R: A2LTb\M)^ - I (B"; A2LTb\M)^ 
and show that it is less than n (^C + 2Q^ : 

I (M; AaLTslB")^ + / {R; A^LTbIM)^ - / {B";A2LTb\M)^ 
= I (M ; A^LTbB^')^ + I {R; A^LTbM)^ - I {B"; A^LTbM)^ - I {R; M)^ 
= I (M; AaLTsS")^ + / [R- A^LTbM)^ - I (S"; A2LTbM)^ 
= H {A2LTbB^^)^ + H (i?)^ - H {RA2LTb\M)^ - H [B^^ 
= H {A2LTbB")^ - H {A2LTb\MR)^ - H (B")^ 
= H {A2LTb\B^)^ - H {A2LTb\MR)^ . 

The first equality follows by applying the chain rule for quantum mutual information. The second equality 
follows because / (_R; M)^ = for this protocol. The third equality follows by expanding the quantum 
mutual informations. The next two inequalities follow from straightforward entropic manipulations and 
that H (R)^ = H {R\A1)^ for this protocol. We continue below: 

= H{A2L\B'')^ + H{Tb\B^A2L)^ -H{Tb\MR)^ -H{A2L\TbMR)^ 

= H {A2L\B-)^ + H (Tb\B-A2L)^ - H {Tb)^ - H {A2L\TbMR)^ 

= H (A2L|S")„ - / (Ts; B^A2L)^ - H {A2L\TbMR) 

< H {A2L)^ - H {A2L\TbMR)^ 

= I{A2L-TbMR)^ 

= I {L- TbMR)^ + / {A2\TbMR\L)^ 

<n(c + 2Q\. 



The first two equalities follow from the chain rule for entropy and the second exploits that H {Tb\MR) = 
H (Tb) for this protocol. The third equality follows from the definition of quantum mutual information. 
The inequality follows from subadditivity of entropy and that / (Tg; B^'^A2L)^ > 0. The fourth equality 
follows from the definition of quantum mutual information and the next equality follows from the chain 
rule. The final inequality follows because the quantum mutual information I {L;TbMR)^ can never be 
larger than the logarithm of the dimension of the classical register L and because the quantum mutual 
information I {A2;TbMR\L)^ can never be larger than twice the logarithm of the dimension of the 
quantum register A2. Thus the following inequality applies 



n 



{C + 2Q) <I {AX; B")^ + n (c + 2g) + nS' , 



demonstrating that ^ holds for the net rates 



We now prove the second bound in ( |10[ ). Consider the following chain of inequalities: 

n I 



i{Q + E)=I {R)Bi)^ + / {Sa)Sb) 
= I {.RSa)BiSb)^(^^ 
<I{RSA)BiSB)^,+n5' 
<I{RSa)BiSbM)^. 
< I 



I{RSa)B"A2TbLM)^ 

H {B''A2Tb\LM)^ - H (RSaB''A2Tb\LM)^ 

< H {B''\LM)^ + H {A2\LM)^ + H {Tb\LM)^ - H {RSaB^ A2Tb\LM)^ 

< I {RSaA2Tb)B'^LM)^ +n{Q + E) 



^IiA)B^X)^+n[Q + E 

The first equality follows by evaluating the coherent informations of the respective states <P^^^ and 
^s^Sb ^j^g second equality follows because <P^^^ (g) 'P'^aTb jg ^ product state. The first inequality 
follows from the condition in Q and an application of the Alicki-Fannes' inequality with 5' vanishing 
when e — >■ 0. We suppress the term n5' in the following lines. The next two inequalities follow from 
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quantum data processing. The third equahty follows from the definition of coherent information. The 
fourth inequality follows from subadditivity of entropy. The fifth inequality follows from the definition 
of coherent information and the fact that the entropy can never be larger than the logarithm of the 
dimension of the corresponding system. The final equality follows from the definitions A = RTbA-iSa 
and X = ML. Thus the following inequality applies 



demonstrating that ( 10 1 holds for the net rates. 



We prove the last bound in ( 11 ). Consider the following chain of inequalities: 

n{C + Q + E)= I{M- M)^ + I {RSa)B^Sb)^^^ 

< I{M- M)^, + I {RSa)BiSb)^, + nS' 

< I{M; B^A^TbL)^ + / {RSa)B''A2TbLM)^ 

= I {ML; B^A^Tb)^ + I (M; L)^ - / (A^B^Tb^L)^ 

+ H (B"|LM) + H (A2Tb\B''LM)^ - H {RSaA2TbB''\LM)^ 

= / {ML; B^% + I {ML; A^TbIB^X + I (M; L)^ - / {A2B^^Tb;L)^ 
+ H {A2Tb\B''LM)^ + I {RSaA2Tb)B''LM)^ . 

—MM 

The first equality follows from evaluating the mutual information of the state and the coherent 

information of the product state <?^^i (g) (P^aSb ^ 'pj^g f^j-g^ inequality follows from the condition in Q 
and an application of the Alicki-Fannes' inequality with S' vanishing when e 0. We suppress the 
term nS' in the following lines. The second inequality follows from quantum data processing. The second 
equality follows from applying the chain rule for quantum mutual information to I{M; B^^A2TbL)i^ and 
by expanding the coherent information / {RSa)B"A2TbLM)^. The third equality follows from applying 
the chain rule for quantum mutual information to / (ML; B^ A2TB) ^ and from the definition of coherent 
information. We continue below: 

= / {ML; + / {RSaA2Tb)B''LM)^ 

+ / {ML; AaTfili?")^ + / (M; L)^ - I {A2B^Tb;L)^ + H {A2Tb\B''LM)^ 
= / {ML; + / {RSaA2Tb)B^LM)^ 

+ H {A2TbW')^ + I {M; L)^ - I {A2B"Tb;L)^ 

< I {ML; + / {RSaA2Tb)B''LM)^ + n (c + Q + e'^ 
= I {X; B")^ + I (A>i?"X)^ + n(c + Q + E). 



The first equality follows by rearranging terms. The second equality follows by canceling terms. The in- 
equality follows from subadditivity of the entropy H (^2?B|i3")^, the fact that the entropy H (^2731-6")^ 
can never be larger than the logarithm of the dimension of the systems A2TB, that the mutual infor- 
mation / (M; L)^ can never be larger than the logarithm of the dimension of the classical register L, 
and because I {A2B"Tb; L)^ > 0. The last equality follows from the definitions A = RTbA2Sa and 
X = ML. Thus the following inequality holds 

n{C + Q + E) <I{X; S")^ + / {A)B''X)^ + n (c + Q + E^ + nS' , 



demonstrating that the inequality in (11 1 applies to the net rates. This concludes the catalytic proof of 
the converse theorem. 



6.1 Pure state ensembles are sufficient 

We prove that it is sufficient to consider an ensemble of pure states as in the statement of Theorcm[T]rathcr 
than an ensemble of mixed states as in ( 15 ) in the proof of our converse theorem. Our argument relies on 
a classic trick exploited in quantum Shannon theory |llj . We first determine a spectral decomposition of 
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the mixed state ensemble, model the index of the pure states in the decomposition as a classical variable 
Y, and then place this classical variable y in a classical register. It follows that the communication rates 
can only improve, and it is sufficient to consider an ensemble of pure states. 



Consider that each mixed state in the ensemble in (151 admits a spectral decomposition of the 
following form: 



y 



We can thus represent the ensemble as follows: 



XAB _ 



^p{x)p {y\x) \x) 



(16) 



a:, 2/ 



The inequalities in (9pT) for the dynamic capacity region involve the mutual information I{AX] B)p, 
the Holevo information I{X; B)p, and the coherent information I{A)BX)p. As we show below, each of 
these entropic quantities can only improve in each case if make the variable y be part of the classical 
variable. This improvement then implies that it is only necessary to consider pure states in the dynamic 
capacity theorem. 

Let Q^^^B ({QYiote an augmented state of the following form: 



-^XYAB 



]p{x)p{y\x) \x) {xf (E) \y) {y\^ (8)^"^'- 



>Bti.AA' 



(17) 



This state is actually a state of the form in (12 1 if we subsume the classical variables X and Y into one 



classical variable. The following three inequalities each follow from an application of the quantum data 
processing inequality: 



I{X; B)p = I{X; B)g < I{XY- B)g, 
I{AX; B)p = I{AX; B)g < I{AXY; B)g 
I{A)BX)p = I{A)BX)g < I{A)BXY)e. 



(18) 
(19) 
(20) 



Each of these inequalities proves the desired result for the respective Holevo information, mutual infor- 
mation, and coherent information, and it suffices to consider an ensemble of pure states in Theorem [T] 



7 The quantum dynamic capacity formula 

We introduce the quantum dynamic capacity formula and show how additivity of it implies that the 
computation of the Pareto optimal trade-off surface of the capacity region is tractable. The Pareto 
optimal trade-off surface consists of all points in the capacity region that are Pareto optimal, in the sense 
that it is not possible to make improvements in one resource without offsetting another resource (these 
are essentially the boundary points of the region in our case). We then show how several important 
capacity formulas in the quantum Shannon theory literature are special cases of the quantum dynamic 
capacity formula. 

Definition 3 (Quantum Dynamic Capacity Formula) The quantum dynamic capacity formula of 
a quantum channel A/" is as follows: 

{M) = Tneo,I(AX- B)^ + AJ {A)BX)^ + /i (/ (X; B)^ + / {A)BX)^) , (21) 

(7 

where A, /i > 0. 

Definition 4 The regularized quantum dynamic capacity formula is as follows: 
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Lemma 1 Suppose the quantum dynamic capacity formula is additive for any two channels M and Ai: 



i?A,M [N®M) = I?A,p [N) + Dx,^ {M) . 

Then the regularized quantum dynamic capacity formula for J\f is equal to the quantum dynamic capacity 
formula for J\f: 

In this sense, the regularized formula "single-letterizes" and it is not necessary to take the limit. 

Proof We prove the result using induction on n. The base case for n = 1 is trivial. Suppose the result 
holds for n: Dx^^{M®'^) — nDx,^{M). Then the following chain of equalities proves the inductive step: 

The first equality follows by expanding the tensor product. The second critical equality follows from the 
assumption that the formula is additive. The final equality follows from the induction hypothesis. 

Theorem 2 Single-letterization of the quantum dynamic capacity formula implies that the computation 
of the Pareto optimal trade-off surface of the dynamic capacity region is a tractable convex optimization 
program. 

Proof We employ ideas from Ref. [TJ for the proof. First, we show that the capacity region defined in 
Theorem [l] is concave. The proof is similar to previous ones in Refs. [T2l[27] . Consider any two states 



^XAB" and a^^^" of the form in (14) which arise from the tensor power channel A/"®". Consider the 
following state a^-^^^" which is a mixture of the states a^^^" and a^^^"^ and is also of the form in 



(141 



^UXAB- ^ ^ ^ ^XAB- + (1 _ A) |1) (1|^ 

where < A < 1. Then the following inequalities hold: 

XI {AX; S")^^^ + (1 - A) / {AX; i?"),^ < / {AUX; S"), , 
XI {A)B-X)^^^ + (1 - A) / {A)B-X)^^ = I {A)B-UX)^ , 
XI {X; S"),^ + (1 - A) / {X; i?"),^ < / {UX; B"), , 

because / {AUX; B")^ = / {AX; B"-\U)^ + I {U; B")^ and / {UX; B")^ = / {X; B"\U)^ + I {U; S")^. 
These inequalities demonstrate that the region is concave. 

We would like to characterize all the points in the capacity region that are Pareto optimal. Such a 
task is standard vector optimization in the theory of Pareto trade-off analysis (see Section 4.7 of Ref. [Tj). 
We can phrase the optimization task as the following scalarization of the vector optimization task: 

max wcC + woQ + weE (22) 

C..Q.E,p{x),(l,^ 

subject to 

C + 2Q </(AX;B"),, (23) 
Q + E <I{A)B"X),, (24) 
C + Q + E <I{X;B''),+I{A)B'''X),, (25) 

where the maximization is over all C, Q, and E and over probability distributions px {x) and bipartite 
states (j)-^^ " . The geometric interpretation of the scalarization task is that we are trying to find a 
supporting plane of the dynamic capacity region where the weight vector {wc,wq,we) is the normal 
vector of the plane and the value of its inner product with (C, Q, E) characterizes the offset of the plane. 
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The Lagrangian of the above optimization problem is 

£ (C, Q, E,px (x) , (j)^^'" , Ai, A2, A3) = wcC + wqQ + weE + Ai (/ [AX; B")^ - (C + 2Q)) 

+ A2 {IiA)B"X)^-{Q + E)) 
+ A3 (/ {X; B")^ + / (A>i?"X)^ -{C + Q + E)), 

and the Lagrange dual function g [7\ is 

5(Ai,A2,A3) = sup c(c, Q, E, px (x), (t)^^'" ,\i,\2,X3 

C,Q,E,p{x),<j,AA'^ ^ 

where Ai, A2, A3 > 0. The optimization task is tractable if it is tractable to compute the Lagrange dual 
function. Thus, we rewrite the Lagrange dual function as follows: 

ff(Ai,A2,A3)- sup wcC + wqQ + weE + \^{I{AX;B'-)^-{C + 2Q)) 

C,Q,E,p(x),^AA'r^ 

+ A2 {I{A)B^X)^-{Q + E)) 
+ A3 (/ (X; i?"), + / {A)B'^X)^ -iC + Q + E)) 
= sup {wc - Ai - A3) C + {wq - 2Ai - A2 - A3) Q + {iue - A2 - A3) E 

+ Ai (^I (AX; B"), + ^/ iA)B-X)^ + ^ (/ {X; B"), + / (A)i?"X) J 
= sup {wc - Ai - A3) C + {wq - 2Ai - A2 - A3) Q + (we - A2 - A3) E 

CQ.E 

+ Ai ( max / {AX; i?")^ + {A)B-X)^ + ^ (/ (X; S")^ + / (A)i?"X) J | . 

The first equality follows by definition. The second equality follows from some algebra, and the last 
follows because the Lagrange dual function factors into two separate optimization tasks: one over C, Q, 
and E and another that is equivalent to the quantum dynamic capacity formula with A — A2/A1 and 
/I = A3/A1. Thus, the computation of the Pareto optimal trade-off surface is a tractable optimization 
task if the quantum dynamic capacity formula in (21) single-letterizes. 



7.1 Special cases of the quantum dynamic capacity formula 



We now show how several capacity formulas of a quantum channel, including the entanglement-assisted 
classical capacity [5], the Lloyd-Shor-Devetak (LSD) formula for the quantum capacity pTll^ITU] . and 
the Holevo-Schumacher- Westmoreland (HSW) formula for the classical capacity [l4l[24] are special cases 
of the quantum dynamic capacity formula. 

We first give a geometric interpretation of these special cases before proceeding to the proofs. Recall 
that the dynamic capacity region has the simple interpretation as a translation of the three-faced unit 
resource capacity region along the classically-enhanced father trade-off curve (see Figure[2]for the example 
of the region of the dephasing channel). Any particular weight vector (wcwqjWe) in ( |22[ ) gives a set 
of parallel planes that slice through the {C,Q,E) space, and the goal of the scalar optimization task is 
to find one of these planes that is a supporting plane, intersecting a point (or a set of points) on the 
trade-off surface of the dynamic capacity region. We consider three special planes: 

1. The first corresponds to the plane containing the vectors of super-dense coding and teleportation. 



The normal vector of this plane is (1,2,0), and suppose that we set the weight vector in (22) to 
be this vector. Then the optimization program finds the set of points on the trade-off surface such 
that a plane with this normal vector is a supporting plane for the region. The optimization program 



singles out (23), and we can think of this as being equivalent to setting A2, A3 = in the Lagrange 
dual function. We show below that the optimization program becomes equivalent to finding the 
entanglement-assisted capacity [S] , in the sense that the quantum dynamic capacity formula becomes 
the entanglement-assisted capacity formula. 



13 



2. The next plane contains the vectors of teleportation and entanglement distribution. The normal vector 



of this plane is (0, 1, 1). Setting the weight vector in (22) to be this vector makes the optimization 



program single out (24 1, and we can think of this as being equivalent to setting Ai, A3 = in the 
Lagrange dual function. We show below that the optimization program becomes equivalent to finding 
the quantum capacity pTlf25l[TU] . in the sense that the quantum dynamic capacity formula becomes 
the LSD formula for the quantum capacity. 

A third plane contains the vectors of super-dense coding and entanglement distribution. The normal 



vector of this plane is (1, 1, 1). Setting the weight vector in (22 1 to be this vector makes the optimiza- 



tion program single out (25), and we can think of this as being equivalent to setting Ai, A2 = in 
the Lagrange dual function. We show below that the optimization becomes equivalent to finding the 
classical capacity pnESHTU] . in the sense that the quantum dynamic capacity formula becomes the 
HSW formula for the classical capacity. 

Corollary 1 The quantum dynamic capacity formula is equivalent to the entanglement- assisted classical 
capacity formula when A, /j, = 0, in the sense that 

max / {AX; B) = max / [A; B) . 
Proof The inequality max^- / [AX; B) > max jaa' I {A; B) follows because the state a is of the form in 



( 14 ) and we can always choose px {x) — S^^xo ^-nd (f>-^^ to be the state that maximizes / (A; B) 



We now show the other inequality Ta£iX„ I {AX; B) < max^AA' I {A; B) . First, consider that the 
following chain of equalities holds for any state ip^^^ resulting from the isometric extension of the 
channel: 

/ {A; B)^H{B) + H {A) - H (AB) 
^ H (B) + H (BE) -H{E) 
^ H (B) + H {B\E) . 

In this way, we see that the mutual information is purely a function of the channel input density operator 
Tryi l^'^'^ |. Then consider any state a of the form in (14). The following chain of inequalities holds 

/ {AX; B)^ = H {A\X)^ + H {B)^ - H {E\X)^ 
= H{BE\X)^ + H{B)^-H{E\X)^ 
= H{B\EX)^ + H{B)^ 
<H{B\E)^ + H{B)^ 
< max /(A; B) . 

^AA' 

The first equality follows by expanding the mutual information. The second equality follows because the 
state on ABE is pure when conditioned on X. The third equality follows from the entropy chain rule. 
The first inequality follows from strong subadditivity, and the last follows because the state after tracing 
out systems X and ^ is a particular state that arises from the channel and cannot be larger than the 
maximum. 

Corollary 2 The quantum dynamic capacity formula is equivalent to the LSD quantum capacity formula 
in the limit where A — >■ c» and /x is fixed, in the sense that 

maxI{A)BX) =maxI{A)B). 

a ^AA> 

Proof The inequality max^ I {A)BX) > max.^A' I{A)B) follows because the state a is of the form in 



(14) and we can always choose px {x) — 5x.xa and cj)^^ to be the state that maximizes I{A)B). 

The inequahty max^ / {A)BX) < max^AA' I iA)B) follows because / {A)BX) = J2xPx (x) I {A)B)^ 
and the maximum is always greater than the average. 
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Corollary 3 The quantum dynamic capacity formula is equivalent to the HSW classical capacity formula 
in the limit where /i — > oo and X is fixed, in the sense that 



m&xI{A)BX)^+I{X;B)^^ max I{X;B). 

The inequality maX(j / {A)BX)^ + / {X; B)^ > max{p^(j.)_^^} / {X; B) follows by choosing a to be 
the pure ensemble that maximizes / {X] B) and noting that / {A)BX)^ vanishes for a pure ensemble. 

We now prove the inequality max^. / {A)BX)^+I (X; B)^ < max^p^(^^) ,p^^ I {X; B). Consider a state 
^XYBE obtained by performing a von Neumann measurement on the A system of the state a-^^^^ . Then 

/ {A)BX)^ + / {X; B)^ = H (B)^ - H {E\X)^ 
^H{B)^~H{E\X)^ 
<H{B)^-H{E\XY)^ 
= H{B)^-H{B\XY)^ 
= UXY-B)^ 
< max / {X; B) . 

The first equality follows by expanding the conditional coherent information and the Holevo information. 
The second equality follows because the measured A system is not involved in the entropies. The first 
inequality follows because conditioning does not increase entropy. The third equality follows because the 
state UJ is pure when conditioned on X and Y. The fourth equality follows by definition, and the last 
inequality follows for clear reasons. 



8 Single-letter dynamic capacity region for the quantum Hadamard channels 

Below we show that the regularization in ([s]) is not necessary if the quantum channel is a Hadamard 
channel. This result holds because a Hadamard channel has a special structure. The development of the 
proof is similar to that in Ref. , but simplified because we obtain the single-letter result more directly. 

Theorem 3 The dynamic capacity region Ccqe{J^h) of a quantum Hadamard channel Mh *s equal to 
its one-shot region Cqq (A/}/) . 

The proof of the above theorem follows in two parts: 1) the below lemma shows the quantum dynamic 
capacity formula is additive when one of the channels is Hadamard and 2) the induction argument in 
Lemma [l] that proves single- letterization. 

Lemma 2 The following additivity relation holds for a Hadamard channel Mh o,nd any other channel 
N: 

Proof We first note that the inequality D\^^j^{Nh®J^) > D\^^{J\fH) + Dx^^{J\f) holds for any two channels 
simply by selecting the state a in the maximization to be a tensor product of the ones that individually 
maximize D\^^(Mh) and D\^^(M). 

So we prove that the non-trivial inequality Dx,fi{NH®J^) < ^a,^(A/'h)+-Da,p(A/') holds when the first 
channel is a Hadamard channel. Since the first channel is Hadamard, it is degradable and its degrading 
map has a particular structure: there are maps ^"^'^ and Ti^~^^^ where F is a classical register and 
such that the degrading map is V^"^^^ o P^^"*'^ [^[TS]. Suppose the state we are considering to input 
to the tensor product channel is 

pXAA'^A', ^Y^^^ (X)|X) {xf®<i^^^'-^K 

X 

and this state is the one that maximizes Dx^^{Mh ® M). Suppose that the output of the first channel is 

qXAB^E^A'^ ^ jjA'^^BtEt, XAA'^A'^-. 
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and the output of the second channel is 



Finally, we define the following state as the result of applying the first part of the Hadaniard degrading 
map (a von Neumann measurement) to lo: 

^XYAEiB2E2 ^ jyBi^Y i^^XABiEiB2E2^ 

In particular, the state a on systems AE1B2E2 is pure when conditioned on X and Y . Then the following 
chain of inequalities holds: 

^A,p (Mh ^M)=I [AX; BiB2)^ + XI {A)BiB2X)^ +fi{I {X; BiB^)^ + I {A)BiB2X)J 

= H {BiB2EiE2\X)^ + XH {BiB2\X)^ + (/i + 1) H (SiSa)^ -{X + fi+l)H {EiE2\X)^ 
= H {B,E^\X)^ + XH {B,\X)^ + {fi + l)H (B,)^ -iX + ^i + l)H + 

H {B2E2\BiEiX)^ + XH {B2\B^X)^ + (/i + \) H {B2\Bi)^ -{X + ti + l)H {E2\EiX)^ 

< H {B,E,\X)g + XH iB,\X)g + ifi + l)H (Si), -{X + fi + l)H {E,\X)g + 

H {B2E2\YX)^ + XH {B2\YX)^ + (/i + l)H {B2), -{X + ^ + l)H (E2\YX)^ 
= I {AA'2X; Bi), + XI {AA'2)B^X)g + fi(I {X; B^), + I (AA^)BiX)e) + 

/ {AEiYX; B2), + XI {AEi)B2YX)^ +^i{I {YX; B2), + / {AEi)B2YX)^) 

< Ca,m (A/h) + Dx^,, (Af) . 

The first equality follows by evaluating the quantum dynamic capacity formula -Da./^ i-Xfti ® A/") on the 
state p. The next two equalities follow by rearranging entropies and because the state uj on systems 
AB1E1B2E2 is pure when conditioned on X. The inequality in the middle is the crucial one and follows 
from the Hadamard structure of the channel: we exploit monotonicity of conditional entropy under 
quantum operations so that H {B2\BiX)^ < H{B2\YX)^ and H{E2\YX)^ < H{E2\EiX)^. The next 



equality follows by rearranging entropies and the final one follows because 6* is a state of the form (14) 



for the first channel while cr is a state of the form ( 14 ) for the second channel 



9 The dynamic capacity region of a dephasing channel 

The below theorem shows that the full dynamic capacity region admits a particularly simple form when 
the noisy quantum channel is a qubit dephasing channel Ap where 

Ap {p) = {l-p)p + pA (p), 
Aip)^{0\p\0)\0) (0| + (1H1)|1)(1|. 

A dephasing channel is an example of a quantum Hadamard channel 9J Figure [2] plots this region for 
the case of a dephasing channel with dephasing parameter p = 0.2. The proof of the following theorem 
exploits the same techniques as in Ref. [S]. 

Theorem 4 The dynamic capacity region CcQE{Ap) of a dephasing channel with dephasing parameter 
p is the set of all C , Q, and E such that 

C + 2Q<l + H2{iy)~H2i-fiiy,p)), (26) 
Q + E<H2{iy)-H2h{,y,p)), (27) 
C + Q + E<l-H2ij{i^,p)), (28) 

where v G [0, 1/2], H2 is the binary entropy function, and 



l{v,p) = 




Bradler showed that cloning channels and an Unruh channel are also in the Hadamard class [5]. 
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1.5 




Fig. 2 (Color online) A plot of the dynamic capacity region for a qubit dephasing channel with dephasing parameter 
p = 0.2. The plot shows that the classically-enhanced father (CEF) trade-off curve lies along the boundary of the dynamic 
capacity region. The rest of the region is simply the combination of the CEF points with the unit protocols teleportation 
(TP), super-dense coding (SD), and entanglement distribution (ED). 



Proof We first notice that it suffices to consider an ensemble of pure states wltose reductions to A' are 
diagonal in the dephasing basis (as in Lemma 11 of Ref. [S]). Next we prove below that it is sufficient to 
consider an ensemble of the following form to characterize the boundary points of the region: 

^|0)(0|^®^;f^' + ^|l)(ir®^f^', (29) 

where tpg^ and i/^f^ are pure states, defined as follows for v S [0, 1/2]: 

Tr^ {^^^'} = V |0) (01-^' + (1 - |1) (1|^' , (30) 
Tr^ {^f ^' } = (1 - ^) |0) (0|-^' + HI) • (31) 



We now prove the above claim. We assume without loss of generality that the dephasing basis is the 
computational basis. Consider a classical-quantum state with a finite number 'N of conditional density 
operators 0^"^ whose reduction to A! is diagonal: 



pXAA' ^Y^p^ (a:)|:r)(:r| 



x=0 



We can form a new classical-quantum state with double the number of conditional density operators by 
"bit-flipping" the original conditional density operators: 



1 

^XAA' ^'-Y^p^ (x) (jx) (xl^ + \x + N) {x + N\^ X^V^^'X^') , 

x=0 
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where X is the ax "bit-flip" PauH operator. Consider the following chain of inequalities that holds for 
ah A, ^ > 0: 

/ {AX- B)^ + XI {A)BX)^ + (/ {X- B)^ + / {A)BX)^ 
= H {A\X)^ + {fi + l)H (B)^ + XH {B\X)^ - {X + + 1) H {E\X)^ 
<{ix+l)H (B)^ + H {A\X)^ + XH iB\X)^ - (A + /i + 1) if iE\X)^ 
^{fi+l) + H {A\X)^ + XH {B\X)^ - (A + /i + 1) i/ {E\X)^ 

= (/i +1) + J2px (x) [h (A)^^ + XH (B)^^ -{X + ^i + l)H (E)^ 

< + 1) + max [h {A)^^ + XH {B}^^ - (X + + 1) H [E]^^' 

^{li+l) + H [A)^, + XH (B)^, _ (A + /i + 1) {E)^, . 

The first equality follows by standard entropic manipulations. The second equality follows because the 
conditional entropy H {B\X) is invariant under a bit-flipping unitary on the input state that commutes 
with the channel: H{B)xpBx = H{B)pB. Furthermore, a bit flip on the input state does not change 
the eigenvalues for the output of the dephasing channel's complementary channel: H{E)j^a^^xp^' x) — 
H{E)j^cf^pA'y The first inequality follows because entropy is concave, i.e., the local state is a mixed ver- 
sion of p^. The third cquahty follows because H{B)„b = H (X)^ ^Px (x) (pf -t- Xpl^X)) ^ H {\Y,^Px {x) 
1. The fourth equality follows because the system X is classical. The second inequality follows because the 
maximum value of a realization of a random variable is not less than its expectation. The final equality 
simply follows by defining to be the conditional density operator on systems A, B, and E that arises 
from sending through the channel a state whose reduction to A' is of the form v |0) (0|^ +{1 — v)]!) . 
Thus, an ensemble of the kind in (29) is sufficient to attain a point on the boundary of the region. 



Evaluating the entropic quantities in Theorem[T]on a state of the above form then gives the expression 
for the region in Theorem [4j 



10 Single-letter dynamic capacity region for the quantum erasure channels 

Below we show that the regularization in Q is not necessary if the quantum channel is a quantum erasure 
channel. The quantum erasure channel also has a special structure, but the proof proceeds differently 
from that for a quantum Hadamard channel. 

A quantum erasure channel with erasure parameter e is the following map [13j : 

A/;(p)EE(l-e)p + e|e) (e| . 

Notice that the receiver can perform a measurement {|0) (0| + |1) (1| , |e) (e|} and can learn whether the 
channel erased the state. The receiver can do this without disturbing the state in any way. An isometric 
extension Uj^ -^be -^^ ^^^^^ ^ follows on a purification IV")^^ of the state p"^ : 

In the above representation, we see that the erasure channel has the interpretation that it hands the 
input to Bob with probability 1 — e while giving an erasure flag |e) to Eve, and it hands the input to Eve 
with probability e while giving the erasure flag to Bob. 

Theorem 5 The dynamic capacity region Ccqe(A/'c) of a quantum erasure channel AC is the set of all 
C , Q, and E such that 

C + 2Q < {I - e) {I + H2{p)) , 
Q + E<{l-2e)H2 (p) , 
C + Q + E<l-e-€H2{p), 

where p G [0, 1/2]. 
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Fig. 3 (Color online) A plot of the dynamic capacity region for a qubit erasure channel with erasure parameter e = 1/4. 
The plot shows that the classically-enhanced father (CEF) trade-off curve lies along the boundary of the dynamic capacity 
region and it is not actually a curve but rather a line because time-sharing is optimal. The rest of the region is simply the 
combination of the CEF points with the unit protocols teleportation (TP), super-dense coding (SD), and entanglement 
distribution (ED). 



Figure |3] plots the dynamic capacity region of a quantum erasure channel with erasure parameter 
e = 1/4. It turns out that time-sharing is the optimal strategy here, and there is not an interesting 
trade-off curve for the quantum erasure channel. 

We in fact proved Theorem [s] in Ref. |17j by employing a reductio ad absurdum argument reminiscent 
of the earliest arguments for proving capacities of quantum erasure channels [3] . This approach gives the 
correct answer, but suffers from two shortcomings: 

1. We do not learn much about how to exploit the structure of the quantum erasure channel with the 
reductio ad absurdum approach. As an example. Smith and Yard exploited the simple structure of the 
quantum erasure channel and discovered far reaching consequences |26| . In particular, they discovered 
that the quantum capacity (and for that matter, any future proposed quantum capacity formula) can 
never be generally additive, by combining the erasure channel with another one. 

2. The reductio ad absurdum argument rests on the assumption that several known capacity formulas 
are continuous as a function of channels. Leung and Smith later showed that the known formulas are 
indeed continuous fl9|, redeeming the original argument in Ref. [3]. 

Here, we prove Theorem [5] above by carefully studying the structure of the quantum erasure channel 
and its additivity properties for the full dynamic capacity region. We prove the theorem in a few steps. 
First, we prove that the classical capacity of the quantum erasure channel admits a single- letter formula]^ 



We then simplify the quantum dynamic capacity formula in ( 21 1 for the case of a quantum erasure channel 
and find that it is only necessary to consider certain values of the parameters /i and A when we optimize. 
The proof of Lemma [7] exploits these conditions to show that the quantum dynamic capacity formula is 
additive for the case of two quantum erasure channels. It then follows by a trivial induction step (the 
same as in Lemma [I]) that the full dynamic region single- letterizes and is of the form in Theorem [5] 

Lemma 3 (Bennett et al. [4j) The Holevo information of a quantum erasure channel is equal to 1 — e; 

xiK) = max/(X;S) = 1 - e 
Proof We begin with an input ensemble of the following form: 



The proof already appears in Ref. [4], but it again suffers from the aforementioned shortcomings. 
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where the states 0;^ arc pure (it suffices to consider pure state ensembles for classical capacity). 
Feeding the A' system into the quantum erasure channel leads to a classical-quantum state a-^^ = 
J2xPx {^) \x) ® ^^{4>x )■ J^ob can then measure his system B to learn whether the channel 
erases the qubit. Let Xe denote a classical register where Bob places the result of the measurement so 
that we then have the state a^^^'^. It holds that any entropy evaluated on system B is equal to the 
joint entropy of B and Xe because this measurement does not disturb the state in any way. Consider 
the following chain of inequalities: 

I{X;B)^ = H{B)^-H{B\X)^ 

= H{BXe),-H{BXe\X)^ 

= H{B\Xe),-H{B\XeX)^ 

= {l-e)H{A')^-{l-e)H{A'\X)^ 

= {l-e)I{X-A')^ 

<(1-^)- 

The first equality follows by expanding the mutual information and the second equality follows from 
the above fact regarding the joint entropy of B and Xe- The third equality follows by expanding and 
canceling terms. The fourth equality follows by conditioning on the classical erasiire flag register Xe and 
realizing that the entropy of Bob's system B is the entropy of the input state with probability 1 — e and 
otherwise is the entropy of the erasure state |e) (this latter entropy vanishes because this state is pure). 
Thus, the above sequence of steps reduces an optimization problem on the output of the channel to a 
simple optimization over the input ensemble. The Holevo information is then 1 — e because the quantity 
/ {X\ A')p can never be larger than unity for the case of a qubit erasure channel (it reaches unity for an 
ensemble of two orthogonal pure states chosen with equal probability). 

Lemma 4 The following additivity lemma holds for a quantum erasure channel J\fe : 

X{K 0^e) =xm+x{K) = 2(1 - e) . 

Proof It sufhces to prove the inequality x (A/'t ® M.) < x {N't) + x i-^e) because the other inequality holds 
trivially. We define the following ensemble of states for the tensor product channel Me ^Afe- 



XA[A', =Y^p^ (a;) la;) ^ ^fl^^ (33) 



We can also define the following augmented ensemble based on the above one: 

where (Tq = I, ai = ax, 02 = cry, and (T3 = (Jz- In particular, note that we obtain the maximally 
mixed state when tracing over classical registers / and J. Let 10^^182 g^j^j QXIJB1B2 (jgnote the states 
obtained by sending systems A'^ and A'2 of the respective states p-^^i^a and a^^'^^^^^ through two 
uses of the quantum erasure channel. Let Xe,i and Xe,2 denote the classical variables Bob obtains by 
determining whether the channel erased his states (they also denote the registers where he places the 
results). Consider the following chain of inequalities that holds for any state u)-^^^^^: 

I (X; B,B2)^ = H {B,B2)^ - H {B,B2\X)^ 

= H {BiB2\Xe^iXe ,2)^ - H {BiB2\Xe,iXe ,2^)^ 

= (1 - ef H {A',A'2)^, + {l-e)e(H (A;)^ + H (A'^)^) 

- (1 - ef H {^,A'2\X)^ - (1 - e) e (iJ {A',\X)^ + H {A^^) 

<2{l-e)-{l-efH {A\A'2\XIJ)^ -{l-e)e{H {A\\XIJ)^ + H (A'^IX/J) J 
<2(l-e). 

The first equality holds by expanding the mutual information. The next equality holds because an 
"erasure measurement" does not change entropy. The third equality follows by exploiting the properties 
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of the erasure channels. The first inequality holds because the unconditional entropies of the A' systems 
on the state p are always less than those for the state a. The final inequality follows because the entropies 
in the previous line are non-negative. The statement of the theorem then follows. 



Lemma 5 The quantum dynamic capacity formula in (21) simplifies as follows for a quantum erasure 
channel ■ 

Dx,^,{K)= max (1 - e) (1 + (p)) + A (1 - 2e) (p) + M ((1 " e) - (p)) • (34) 
pe[o,i/2] 

Thus, the "one-letter" dynamic capacity region of a quantum erasure channel is as Theorem^ states. 
Proof We exploit the following classical-quantum states: 

and let p^^BE ^xiabe ^j^g states obtained by transmitting the A' system through the isometric 
extension of the erasure channel. Let =Tr^ |- Furthermore, let the eigenvalues of the state 

with highest entropy on system A' be p and 1 — p. Consider that the following chain of inequalities 
holds for any state p^^BE. 



/ {AX; B)^ + XI {A)BX)^ + p[l (X: B)^ + / {A)BX)^^ 
= H {A\X)^ + (/i +\)H {B)^ + \H {B\X)^, -{X + p + l)H {E\X)^ 
= H iA'\X)^ + Oi + 1)H iB\XE)p + \H {B\XeX)^ - {X + p + 1) H {E\XeX)^ 

ifi + 1) (1 - e) if (A'), + Xil-e)H _ (A -f ^ + 1) ei/ {A'\X)^ 

+ {l + X{l-e)-{X + p + l)e)H {A'\XI)^ 
+ (1 - e + A (1 - 2e) - fie)Y,Px {x) H {A')^^ 



^H{A\X)^ 
< (M + l)(l-e 
= (M + l)(l-e 



(l-e + A(l-2e)-/ie)iJ(A'),. 



= (1 - e) (1 + H2 [p)) + /i (1 - e - eH2 [p)) + A (1 - 2e) (p) . 

The first equality follows by standard entropic manipulations. The second equality follows by incorpo- 
rating the classical erasure flag variable. The third equality follows by exploiting the properties of the 
quantum erasure channel. The first inequality follows because the unconditional entropy of the state p is 
always less than that of the state a. The next equality follows by expanding the conditional entropy. The 
second inequality follows because an average is always less than a maximum. The final equality follows 
by plugging in the eigenvalues of a*. The form of the quantum dynamic capacity formula then follows 
because this chain of inequalities holds for any input ensemble. 

Lemma 6 It suffices to consider the set of X, fi > for which 

(1 - e) + A(l - 2e) > pe. 

Otherwise, we are just maximizing the classical capacity, which we know from Lemma^ is equal to 1 — e. 



Proof Consider rewriting the expression in (34) as follows: 

max (1 - e) + (1 - e) + [(1 - e) + A (1 - 2e) - ^le] H2 (p) . 
pe[o,i/2] 

Suppose that the expression in square brackets is negative, i.e., 

(l-e)-HA(l-2e) < pe. 
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Then the maximization over p simply chooses p = so that H2 (p) vanishes and the negative term 
disappears. The resulting expression for the quantum dynamic capacity formula is 

(l-e)+/x(l-e), 

which corresponds to the following region 

C + 2Q < 1-e, 
Q + E<0, 
C + Q + E<l-e. 

The above region is equivalent to a translation of the unit resource capacity region to the classical 
capacity rate triple (1 — e, 0, 0). Thus, it suffices to restrict the parameters A and fj, as above for the 
quantum erasure channel. 

Lemma 7 The following additivity relation holds for two quantum erasure channels Mf, with the same 
erasure parameter e: 

Proof We prove the non-trivial inequality Dx^f^iMc^^Afc) < D\,fj.i.^e) + D\^^{Afc). We define the following 
states: 

pXAA'.A', ^J2px{x)\x){xf ^ct>^^'^^K 

X 

and we suppose that jg ^j^g state that maximizes D\^n{J\ff®J\f^). Consider the following equality: 

I {AX; B^B2)^ + \I {A)BiB2X)^ + (X; + I {A)B,B2X)J 

= H {A\X)^ + H {B,B2)^ - H {E^E2\X)^ + \{H {B^B^IX)^ - H {E,E2\X)J 
+ f,{H{B,B2)^-H{E,E2\X)J. 

It follows simply by rewriting entropies. We continue below: 

= H {A[A',\X)^ + (1 - ef H {A\A'^)^ + e (1 - e) (if {A',)^ + H (A'^) J 
- 'e-'H {A',A'^\X)^ + e{l-e)(H {A',\X)^ + H (A'^jX) J" 



+ A 
-A 
+ M 



{I - ef H {A\A'2\X)+e{l- c){h {A\\X)^^ + H {A'2\X 



e^H {A',A'^\X)^ + e{l^e)[H (A,\X)^ + H {A^\X) 
(1 - ef H + e{l-e)(H {A',)^ + H {A',)^ 

e'H {A',A',\X)^ + e{l-e){H {A',\X)^ + H (A'^IX) J 



The above equality follows by exploiting the properties of the quantum erasure channel. Continuing, the 
above quantity is less than the following one: 

<2{l-e) + {l-e^)H (^'^ Aj,|X7J), + e (1 - e) (ff K|X7J), + H {A'^\XIJ)^) 

+ \{l-2e)H{A[A'2\XIJ)^ 

+ M [2 (1 - e) - e^H {A,A2\XIJ)„ -e{l-e){H {^,\XIJ)^ - H {AlXIJ)^] 
= 2 (1 - e) + (1 - 6) (if {A[ \XIJ)^ + H (A'^IXIJ)^) + A (1 - 2e) {H {A[ \XIJ)^ + H {A'^\XIJ)^) 
+ M [2 (1 - e) - e (i/ (A'l \XIJ)^ - H (A'^jX/J) J] 
- [(1 - + A (1 - 2e) - /ze^) I {A',; A'^\XIJ)^] 

< £>A,^ m + D^^^ {Af,) - [(1 - e2 + A (1 - 2e) - /xe^) I (A^; A'^jX/J)^" 
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The first inequality follows from similar proofs we have seen for a state u of the form in p3|. The first 



equality follows by rearranging terms. The second inequality follows from the form of I3a./j in (34 1. The 
final inequality follows because Lemma [6] states that it is sufhcient to consider (1 — e) + A (1 — 2e) > \xt. 
Note that this condition implies that 

l-e^ + A(l-2e) > ^e^ 
and hence that the quantity in square brackets in the line above the last one is positive. 



11 Conclusion 

We found a purely information theoretic approach to proving the converse part of the dynamic capacity 
region. This technique should be simpler to understand for those unfamiliar with the quantum Shannon 
theory literature. We also phrased the optimization task for the full dynamic capacity region in terms of 



the quantum dynamic capacity formula in (21) and proved its additivity (and hence single-letterization 
of the dynamic capacity region) for the quantum Hadamard channels and quantum erasure channels. We 
note some open problems below. 

There might be room for improvement in our formulas that characterize the dynamic capacity region 
when the channel is not of the Hadamard class or a quantum erasure channel. Though, our characteri- 
zation has the simple interpretation as the regularization of what one can achieve with the classically- 
enhanced father protocol [16j combined with teleportation, super-dense coding, and entanglement distri- 
bution. It is difhcult to imagine a simpler characterization than this one, despite its multi-letter nature 
for the general case. 

We would like to find other channels besides the Hadamard or erasure channels for which the region 



single-letterizes. We conjecture that additivity of the quantum dynamic capacity formula in (21) holds 
for channels that have hybrid Hadamard-erasure behavior such as the phase erasure channel in Ref. [4j. 
It would also be interesting to find channels that are not hybrid Hadamard-erasure for which additivity 



of ^ holds. 

There is also one interesting speculation to muse over that Professor David Avis suggested to us. Do 
each of the inequalities in Theorem [T] correspond to some fundamental physical law? This might shed 
further connections between information theory and physics that have not been elucidated yet. 
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